9 
10 
12 
.14 
15 
17 
18 
20 
21 
23 
25 
27 
2 8 
29 
30 
32 
33 
35 
37 
39 
41 
43 
45 
47 
49 
51 
53 
55 
5 7 

5 9 
61 

6 3 
65 
67 
69 
71 
73 

7 5 
78 
79 
BO 
81 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 6 2', 4 80 



DATE: 12/29/2000 
TIME: 11:58:22 



Input: Set : 
Output SOU: 



A:\ES.txt 

N:\CRF3\12 292O0O\I4 624 8 0.raw 



TECH CENTER 16QQ/29TO 



3 <110> 

4 

5 

6 

8 <I20> 



gacgatcagc 
cgcatagggc 
c t tccgtcgg 
a tggcggccg 
<210> SEQ T D 
<211> LENGTH 



APPL'I CANT : GI COW EL, B R X fi 1 TT E 
BERTHET , FRANC J.OS - XAVTER 
ANDERSEN , PETER 
RASMUSSEN, PETER BIRK 

TITLE OF INVENTION: POLYNUCLEOTIDE FUNCTIONALLY CODING FOR 
TUBERCULOSIS, ITS BIOLOGICALLY ACTIVE DERIVATIVE FRAGMENTS, 
USING THE SAME 

FILE REFERENCE: 06 60 -016 5 - 0XPCT 
CURRENT APPLICATION NUMBER: 09/462/480 
CURRENT FILING DATE: 2000-03-06 
PRIOR APPLICATION NUMBER: PCT/IB9 8/010 91 
PRIOR FILING DATE: 19 98-07-16 
PRIOR APPLICATION NUMBER: 60/052,631 
PRIOR FILING DATE: 1997*07-16 
NUMBER OF SEQ ID NOS : 34 
SOFTWARE: Patent! n version 3.0 
SEQ ID NO: 1 
LENGTH: 127 7 
TYPE : DNA 

ORGANISM: Mycobacterium tuberculosis 
SEQUENCE: 1 * 

tgacgtcgtt gttcagocag gtgygcygca ccggcggcgg caaeccagcc 
ccgcgcagat yggcctgctc ggcaccagtc cgctgtcgaa ccatccgctg 
caggccccag cgcggycgcg gyoctgctgc gcgcggagtc gctacetggc 
cgttgacccg cacgccgctg atgtctcagc tgatcgaaaa gccggttgcc 
tgcoggcggc tgttgccgga tcgtcgytga cggybggcgc cgctccggtg 
gggttcgcaa tccygcggct ceacoagcec gggtctggtc 
ggagcyLgaa gaagacgacg aggacyactg ggacgaagag 
gagctcccgt aatgacaaca gacttcccgg ccacccgggc cggaayactt 
ygtaaagaga gaaagtagtc cagcatygca gagatgaaga 
caggaggcag ytaatttcya gcggatctcc gycgacctga 
gagtcgacgg caggttcgtt g'cayggccag tggcgcggcg 
gccycygtgg tgcgcttcca agaaycagcc aataaycaga 
tcgacyaata ttcgtcaggc cggcgtccaa tactcgaggg 
ycgctgtcct cgcaaatggg cttctgaccc gctaatacga 
Lgacagagca gcagtggaat ttcgcggyta tcyagyccyc 
atyteacgtc car. tcatt.cc ctccttgacg aggggaagca 
cggectgggg cggtagcggt tcggaggcgt accagggtgt 
eggotaccga gctgaacaac gcgctgcaga acctggcgcg 
ag gca a t g g c t Leg a cog a a g g caa eg tea c t gg g a tg 1 1: 
tcgcgt.'agaa tagegaaaca egygateggg cgagttcgac 
gcgcactctg agaggttgtc 



THE "LHP PROTEIN FROM MYCOBACTERIUM 
AS WELL . AS METHODS 



<130> 
<14 0> 
<14I> 
<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<400> 

ctgeagcagg 
gaegaggaag 
gctggtggat 
gcaggtgggt 
ccctcggtga 

ggtcegggag egatgggeca 
gcgccggcac cgctcgcgca 
gacgactggt 
gecaacattt 
ccgatgccgc 
aaacccaga t 
egyeggggae 
agcaggaact 
ccgacgagga 
aaagaaaegg 
ggcaagegea 



tggcgaggaa 
taccctcgyg 
cgaccaggtg 
ggccgcccag 
cgacgagatc 
gcaycagcag 
aycaaaaaca 
a tccagggaa 
gtccctgacc aagctcgcag 
ccagcaaaaa tggyacgeca 



gaagcegg Lc 
aacgecgagt 
tc tcgccctt 
actacga 
NO : 2 
524 



tctcytgttt atacgtttya 



60 
120 
18 0 
240 
300 
360 
4 20 
4 80 
54 0 
600 
6 6 0 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
120 0 
I 260 
1277 



<2I2> TYPE: DNA 
<213> ORGANISM: 



Mycobac ter.i um tubercu.l cs i s 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 62,4 8 0 
Input Sot : A:\ES.txt 

Output Set: N:\CRF3\12292000\l4624 80.raw 



DATE: 12/29/2000 
TIME: 11:58:22 



83 
84 
86 
88 
90 
92 
94 

9 6 
98 

10 0 
10 3 
104 
105 
106 
108 
10 9 
111 
113 
115 
117 
119 
121 
123 
12 5 
128 

12 9 

1 3 0 
.1 31 
133 
134 
136 
.13 8 

14 0 
14 2 
14 4 
14 7 
1 4 8 
149 
150 
.1 5 2 
154 
155 
157 
153 
16 0 
161 
163 
164 
166 



<4 00> SEQUENCE: 2 

ctgcagcagg tgacgtcgtt gttcagccag gtgggcggca 
gacgagqaaq ccgegcaqat gggcctgctc gqcuccagtc 
gctggtggat caggccccaq cgcgggcgcq ggcctgctgc 
gcaggtyggt cgttgacccg cacgccgctg atgtctcagc 
ccctcggtgn tgccgqcggc tgttgccgga tcgtcggtga 
qqtccgqgag cqatqgqcca ggqttcqcaa tccqqcggct 
gcgccggcac cgctcgcgca ggagcgtgaa gaagacgacq 
gacgactggt yagctcccgt aatyacaaca gacttcccgg 

gccaacottt tggeyagyaa gytaaagaga yaaagtagtc 

<210> SEQ ID NO: 3 

<211> LENGTH: 4 81 

<212> Tl'i'E: UNA 

<213> ORGANISM: Mycobacterium tuberculosis 
<4 00> SEQUENCE: 3 

ctgcagcagg tgacgtcgtt gttcagccag qtgqgcggca 
yacgaygaag ccyegeagat gggcctgctc. ggcaccagtc 
gctggtggat caggccocag cgcgggcgcg ggcctgctgc 
gcaggtgggt cgttgacccg cacgccgctg atgtctcagc. 
ccctcqqtqa tgccggcgqc tgttgccgga tcgtcggtga 
ggtccgggag cgatyggcca gggttcgcaa tccggcygct 
gcgccggcac cgctcgcgca ggagcgtgaa gaagacgacg 
gacgactggt gagctcccgt aatgacaaca gacttcccgg 

q 

<210> SEQ ID NO: 4 
<211> LENGTH : ' 302 
<212> TYPE: DNA 

<213> ORGANISM: Mycobacterium tuberculosis 
<400> SEQUENCE : 4 

atggcagaga tgaagacoga tgccgctacc ctcgggcagg 
atctccggcg acc tgaa a ac ccagatcgae caggtggagt 
ggccagtggc gcggcycyqc qgggacggcc gcccaggccg 
gcagccaata agcagaagca ggaactcgac gagatctcga 
gtecaataet cgagggccga cgaggagcag cagcaggcgc 
tg 

<210> SEQ ID NO: i> 
<211> LENGTH : 100 
<212> TYPE: PRT 
<213> ORGANISM: 
<4 0 0> SEQUENCE: 
Met Ala Glu Met 
1 



ccggcygcgg 
Cqctqtcqaa 
gcgcggagtc 
tgatcqaaaa 
cggg tggcgc 
ccaccagccc 
aggacgac tg 
ccacccgggc 
cage 



caacccagoc 
ccatccgc tq 
gctacctggc 
gccggttgcc 
cgctccgg tg 
gqq tc tqq tc 
gqacgaagag 
eggaagaett 



CCgqcgqcqq 
cgctgtcgaa 
gcgcggagtc 
tga tcgaaaa 

Cqgq tgqcqc 

ccaccagccc 
aggacgactg 
ccacccgggc 



caacccagoc 
ccatccgc tg 
gctacctggc 
gccggttgcc 
cyctccqgtq 

gggtctggtc 

ggacgaagag 
eggaagaett 



aggcaggtaa 
egaeggcagg 
cggtggtgcg 
cgaa tat teg 
tgtcctcgca 



tttcgagegg 
Ltogttgcag 
cttccaayaa 
tcaggccggc 
aatgggcttc 



Mycobacterium tuberculosis 

5 

Lys Thr Asp Ala Ala Thr Leu 
5" 10 
As 11 Phe Glu Arg lie Ser Gly Asp Leu Lys Thr 



20 



25 



Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp 

3 5 4 0 

Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin 

50 55 

Gin Lys Gin Glu Leu Asp Glu Tie Ser Thr Asn 



Gly Gin Gl u Ala Gly 
15 

Gin He Asp Gin val 
30 

Arg Gly Ala Ala Gly 
45 

Glu Ala Ala Asn Lys 
60 

He Arg Gl a Ala Gly 



60 
120 
180 
240 
300 
360 
4 20 
4 80 

5 24 



60 
120 
180 
24 0 
300 
360 
420 
480 
481 



60 
120 
180 
240 
300 
302 



RECEIVED 

JAM 1 0 2001 

TECH.CENTER1600/290O 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: 



US/09/462,480 



DATE : .12/29/2000 
TIME: 11:58:22 



TnpuL Set : 
Output Sot: 



A:\ES.txt 

N:\CRF3\12 29 2000\l4 6 24 80.raw 



.167 65 70 75 

16 9 Val Gin Tyr Ser Art) Ala Asp Clu Clu Gin Gin Gin 
170 85 90 

172 Gin Met Gly Phe 
.17 3 100 

17 5 <210> SKO ID NO: 6 

176 <211> LENGTH: 4 9 

177 <212> TYPE: PRT 

178 <2I3> ORGANISM: Mycobacterium tuberculosis 
13 0 <40 0> SEQUENCE : 6 

18 2 Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Gly 
18 3 1 5 .10 

18 5 Asn Phe Glu Arg He Ser G.ly Asp Lou Lys Thr Gin 

18 6 20 25 

188 Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg 

189 35 40 
191 Thr 

194 <210> SEQ ID NO: 7 

195 <211> LENGTH: 42 

196 <212> TYPE: PRT 

197 <213> ORGANISM: Mycobacterium tuberculosis 

19 9 <4 0 0> SEQUENCE: 7 

2 01 Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp 

202 1 5 10 

204 ,,sn J.le A eg Gin Ala G.ly Val Gin Tyr: Ser Arg Ala 

2 0 5 2 0 25 

2 07 Gin Gin Ala Leu Ser Ser Gin Met Gly Phe. 

208 3 5 40 

210 <210> SEQ ID NO: 8 

211 <211> LENGTH: 21 

212 <212> TYPE: PRT 

213 <2I3> ORGANISM: Mycobacterium tuberculosis 
215 <4 00> SEQUENCE: 8 

217 Gin Glu Ala Gly Asn Phe. Glu Arg He Ser Gly Asp 

218 1 5 10 
2 20 G.in He Asp Gin Val 

22.1 20 

22 3 <210> SEQ ID NO: 9 

224 <211> LENGTH: 16 

225 <212> TYPE: PRT 

226 <213> ORGANISM: Mycobacterium tuberculosis 
228 <400> SEQUENCE: 9 

2 30 Giy Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser 
23.1 15 10 

23 3 <210> SEQ ID NO: 10 
234 <211> LENGTH: 16 

23 5 <212> TYPE: PRT 

236 <213> ORGANISM: Mycobacterium tuberculosis 
238 <40 0> SEQUENCE: 10 



80 

Ala Leu Ser Ser 
95 



Gin Glu Ala Gly 
15 

He Asp Gin Val 
30 

Gly Ala Ala Gly 
4 5 



Gly lie Ser Thr 
15 

Asp Glu Glu G.ln 
30 



Leu Lys Tyr Thr 
15 



Thr Ala Gly Ser 
15 



RECEIVE 

M 102001 

TECH CENTER IS® 
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RAW SEQUENCE LISTING DATE: .12/29/2000 

PATENT APPLICATION : US/09/4 62 , 4 8 0 TIME : 11 : 5 8 : 22 

Input; Set : A:\ES.txt 

Output Set: N:\CRF3\12292000\I4624 80.raw 

240 Gly Ser Leu Glu Gly Gift Trp Arcj Gly Ala Ala Gly Thr Ala Ala Ala 
'-Ml 1 5 10 " 15 

24 3 <2.10> SEQ 10 NO : 11 
24 4 <2:il> LENGTH: 16 
24 5 <212> TYPE: PRT 

2 'J 6 <213> ORGANISM: Mycobacterium tuberculosis 

24 8 <4 00> SEQUENCE: .11 

250 Gin Glu Ala Ala Asn Lys Glu Lys G.la Glu Leu Asp G.lu He Ser Thr 

251 i 5 10 15 
2 53 <210> SEQ ID NO: 12 

2 54 <211> LENGTH: 28 

25 5 <2.12> TYPE: PRT 

256 <213> ORGANISM: Mycobacterium tuberculosis • 
250 <400> SEQUENCE: .12 

260 Ser Thr Asn He Ary Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu 

261 1 5 10 * ' 15 

26 3 Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
264 20 25 

266 <210> SEQ ID NO: 13 
2 67 <211> LENGTH: 16 
26 8 <21 2> TYPE: PRT 

269 <213> ORGANISM: Mycobacterium tuberculosis 
271 <400> SEQUENCE: 13 

273 Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 

274 I 5 10 15 

276 <210> SEQ TO NO: 14 

277 <211> LENGTH: 21 

278 <212> TYPE: DNA 

279 <213> ORGANISM: Artificial/Unknown 

281 <220> FEATURE : 

282 <221> NAME/KEY : mis c_ f ea t u r e 

283 <22 2> LOCATION : ()..{) 

284 <22 3> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 

28 7 <4 0 0 > S EQUENCE : 1 4 

288 ctgoagcagg tgacgfegtt g 21 

291 <210> SEQ ID NO: 15 

292 <211> LENGTH: 23 

293 <212> TYPE: DNA 

29 4 <213> ORGANISM: Art if i c:i a 1/Unknown 
29 6 <220> FEATURE: 

297 <221> NAME/KEY : misc_f eatu.re 

298 <222> LOCATION: ()-.() 

29 9 <22 3> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 

303 <400> SEQUENCE: 15 

304 cegggtggcc gggaagtctg tgt 23 

307 <210> SEQ ID NO: 16 

308 <211> LENGTH: 2 3 

309 <212> TYPE: DNA 

310 <213> ORGANISM: Art! lie la. 1 /Unknown 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/4 6 2,4 80 



DATE 
TIME 



.12/29/2000 
11: 58 : 22 



Input Set : A:\ES.txt 

Output Set: N:\CRF3\12292000\I462480.raw 



312 <220> FEATURE: 

3 13 <221> NAME/KEY: m:i sc_f ea Lure 

314 <222> LOCATION: ()..() 

315 <223> OTHER INFORMATION : Description of Artificial Sequence: synthetic ONA 

319 <4 00> SEQUENCE: 16 

320 actactttct ctttctacct tec 2 3 

323 <2.10> SEQ ID NO: 17 

324 <2:il> LFJNGTH : 39 

325 <212> TYPE: DNA 

326 <213> ORGANISM: Artificial/Unknown 

328 <220> FEATURE: 

329 <221> NAME/KEY: mi.sc_f eatu re 

330 <222> LOCATION: ()..() 

33.1 <223> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 
3 34 <4 00> SEQUENCE: 17 

335 qggqggatcc ggtaccagqt qacqtcgttq ttcagccag 3 9 

338 <210> SEO ID NO: 18 

339 <211> LENGTH : 39 
34 0 <212> TYPE: DNA 

341 <213> ORGANISM : Artificial/Unknown 
34 3 <220> FEATURE: 

344 <221> NAME/KEY : mi sc_ feature 

345 <222> LOCATION: ()..() 

346 <223> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 

34 9 <400> SEQUENCE: 18 

3 50 ggggggtacc ggatectcgt agt.cqgccgc catgacaac 39 

35 3 <210> SEQ ID NO: 19 
354 <211> LENGTH: 3.1 

35 5 <212> TYPE: DNA 

356 <213> ORGANISM : Artificial/Unknown 

358 <220> FEATURE: 

359 <221> NAME/ KEY : misc_feature 

360 <222> LOCATION: ()..() 

361 <22 3> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 
364 <4 0 0> SEQUENCE: 19 

36 5 ggggggatcc caggtgacgt cgttgttcag c 31 
368 <210> SEQ ID NO: 20 

36 9 <211> LENGTH: 31 

370 <212> TYPE: DNA 

371 <213> ORGANISM: Artificial/Unknown 
373 <220> FEATURE: 

37 4 <221> NAME/KEY: misc_f eature 

375 <222> LOCATION,;.. ()..() 

376 <22 3> OTHER INFORMATION: Description of Artificial Sequence: synthetic DNA 
3 79 <4 00> SEQUENCE: 20 

300 ggggggtacc acggtgacgt cgttgttcag c 31 
383 <210> SEO ID NO: 21 
3 84 <211> LENGTH: 32 
385 <212> TYPE: DNA 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> to 
<223> fields of each sequence which presents at least one n or Xaa. 
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VERIFICATION SUMMARY DATE : 

PATENT APPLICATION: US/09/4 62,4 80 TIME : 11:58:23 

Tnput Set: : A:\ES.txt 

OutpuL Set: N:\CRF3\12292000\I4 624 80.raw 



L:607 H:341 W: (46) " n " or "Xaa" used, for SEQ IDs: 34 
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